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SEQUENCE LISTING 


<110.> Stomp, Anne-Marie 
Dickey, Lynn 
Gasdaska, John 


<120^ Expression of Biologically Active 
Polypeptides in Duckweed 

<:130> 40989/237225 


<15C)-> US 60/293, 330 
<151> 2001-05-23 

<150 ■> US 60/221, 705 
<151 * 2000-07-31 

<160 - * 

1 "7 0 • LastSEO for Windows Version 4.0 

*:2rr- l 
<211 ■ : 54 
<2 12> DMA 
vi!3 ■ 2ea mays 


ratoaagtgc auaggcccgc ettgtttctc ctctgtctct tgatctgact aatcttggtt 60 

..at ga t c gt tuagtaattt tggggaaagc ttcgt-:ca:a gttttttttt cgatgaacag 120 

..googcugtg gogctgatct tgta:g::a: cctgcaat::g tggtgaactt acgtctttta 1 

vatcctrcac :a:ca:gaaa agactagtaa cetttctcga tgtaacatcg tc:agcactg 2- 

rlattarcgt gvggtc^atc egacagcctg gctgaacaza tcatacgata ctgagcaaag 3 1 

r.ctdtrttc ectgtt::ttt aatgaaagac gtcatttt-a tcagtatgat c:aagaatg: 3 

:gcaacrtgc aaggaggcgt t:c:::-:tt: gaatt*:aa::t aactcgttga gcggccecgt 1. 

:tctcggacg taaggc::ttt gctgctccac acatgtccat tcgaatttta ccgtgtttag 480 

:aagqg rgaa augtttgcat cttgatgatt taget *:ga zt atgcgattgc tc ceetggac 54 0 

:cgtgcagct gogg 554 


1 ■ :• 4 93 
•:ii:.- DMA 

■21 5 - Artificial Sequence 


__3.- Du^-cwe^d codon optimized nucleotide sequence 
coding human alpha-23 interferon 

1^1:- CDS 

i_i.- 1 ) . . . ,4 98 ) 


-ja : ::c cz-z :ag acc cue age etc gg ? ::c cgc ^gc acc czc atg 
Asp Leu Pro Gin Thr His Ser Leu Gly Ser Arg Arg Thr Leu Met 
5 10 15 


1 


ctg ctg gcg cag atg cgc cgc ate teg etc ttc age tgc ctg aag gac 96 
Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 
20 25 30 

cgc cac gac ttc ggc ttc ccg cag gag gag ttc ggc aac cag ttc cag 144 
Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Gly Asn Gin Phe Gin 
35 4 0 4 5 

aag gec gag acg ate ccc gtg etc cac gag atg ate eag cag ate ttc 192 
Lys Ala Glu Thr He Pro Val Leu His Glu Met lie Gin Gin He Phe 
50 55 60 

aae ctg ttc age ace aag gac age teg gec gee tgg gac gag ace etg 24 0 
Asn Leu Phe Ser Thr Lys Asp Ser Ser ALa Ala Trp Asp Glu Thr Leu 
65 70 75 80 

etc gae aag ttc tac ace gag etg tae eag cag etc aae gac ctg gag 283 
Leu Asp Lys Phe Tyr Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu Glu 
3 5 9 0 95 

geg tg- gtg ate eag ggg gtt ggg gtt aeg gag aeg eeg etg atg aag 336 
Ala Cys Val He Gin GLy Val Gly Val Tnr Glu Thr Pro Leu Met Lys 
100 105 110 

gag gae age ate etc gee gtg ege aag :a: ttc eag ego ate acg et- 38 4 
Glu Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr Leu 
1 115 120 125 

tae otc aag gag aag aag ta: ajz cog zjz gee tgg gag gte gtt ege 4 32 
Tyr Leu Lys Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val Arg 
130 135 140 

gee gag at z atg eg:: tee ttc ag- etg age aee aac etc eag gag aqz 480 
/Ma Glu lie Met Arg Ser Phe Ser Leu Ser Tnr Asn Leu Gin Glu Ser 
14 5 150 155 160 

etc ege tec aag gag taa 4 93 

Leu Arg Ser Lys Glu * 
165 


-.■211.. :M5 
M:l. • DNA 

-■'2 1: * Ory za sat i va 

- 4 0 0 • 3 

aocatgeagg t:ctgaa:ac gatggtcaac aagcaettee tetcectgtc cgtccteatc >~0 
gt cote "teg ggctgagaag caacct:a:c geegge --*6 

t/:i; • 4 
t.:ll • 1.88 

- :2 1 :■ * Homo sap lens 


• ; DO • 4 

[•■let Ala Leu Thr Phe Ala Leu Lea Val Ala Leu 

9 


Leu Val Leu Ser Cys 


15 10 15 

Lys Ser Ser Cys Ser Val Gly Cys Asp Leu Pro Gin Thr His Ser Leu 

20 25 30 

Gly Ser Arg Arg Thr Leu Met Leu Leu Ala Gin Met Arg Arg lie Ser 

35 40 45 

Leu Phe Ser Cys Leu Lys Asp Arg His Asp Phe Gly Phe Pro Gin Glu 

50 55 60 

Glu Phe Gly Asn Gin Phe Gin Lys Ala Glu Thr He Pro Val Leu His 
65 70 75 80 

Glu Met He Gin Gin He Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser 

85 90 95 

Ala Ala Trp Asp Glu Thr Leu Leu Asp Lys Phe Tyr Thr Glu Leu Tyr 

100 105 110 

Gin Gin Leu Asn Asp Leu Glu Ala Cys Val He Gin Gly Val Gly Val 

115 120 125 

Thr Glu Thr Pro Leu Met Lys Glu Asp Ser lie Leu Ala Val Arg Lys 

130 135 140 

Tyr Phe Gin Arg lie Thr Leu Tyr Leu Lys Glu Lys Lys Tyr Ser Pro 
145 150 155 160 

Cys Ala Trp Glu Val Val Arg Ala Glu He Met Arg Ser Phe Ser Leu 

165 170 175 

Sor Thr Asn Leu Gin Glu Ser Leu Arg Ser Lys Glu 
13 0 18 5 


<210> 5 

<;:ii> 165 

<J12*- PRT 

oil 3-- Homo sapiens 
o400 ■ 5 

Cys Asp* Leu Pro Gin Thr His Ser Leu Gly Ser Arg Arg Thr Leu Met 

1 5 10 15 

L>ju Leu Ala Gin Met Arg Arg He Ser Leu Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Gly Asn Gin Phe Gin 

3 5 4 0 4 5 

Lys Ala Glu Thr He Pro Val Leu His Glu Met He Gin Gin He Phe 

[0 5 5 60 

Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr Leu 
6= 7 0 7 5 3 0 

Lou Asp' Lys Phe Tyr Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu Glu 

35 K- 95 

A.', a Cys Val He Gin Gly Val GLy Val Thr Glu Thr Pro Leu Met Lys 

:.oo 10 5 no 

Glu Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr Leu 

115 120 125 

Tyr Leu Lys Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val Arg 

".3C 135 140 

A', a Glu He Met Arg Ser Phe Ser Leu Ser Thr Asn Leu Gin Glu Ser 
145 150 155 160 

Leu Arg Ser Lys Glu 
165 


:..10 
:111 


J) 


<212> PRT 

<213> Oryza sativa 
<400> 6 

Met Gin Val Leu Asn Thr Met Val Asn Lys His Phe Leu Ser Leu Ser 

15 10 15 

Val Leu lie Val Leu Leu Gly Leu Ser Ser Asn Leu Thr Ala Gly 

20 25 30 


<210> 7 
<211> 31 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Modified rice alpha-amylase signal peptide 
<4 00> 7 

Met Gin Val Leu Asn Thr Met Val Asn Lys His Phe Leu Ser Leu Ser 

15 10 15 

Val Leu lie Val Leu Thr Val Leu Ser Ser Asn Leu Thr Ala Gly 
20 25 30 


<:110;> 8 
<i Ii:- 21 
<:212:> PRT 

<'213> Arabidopsis thaliana 
<:400> 8 

Met Lys Thr Asn Leu Phe Leu Phe Leu lie Phe Ser Leu Leu Leu Ser 

15 10 15 

Leu Ser Ser Ala Glu 
20 


4 


